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ABSTRACT 

We present a fast, accurate, robust and flexible method of accelerating parameter estimation. This 
algorithm, called Pico, can compute the CMB power spectrum and matter transfer function as well as 
any computationally expensive likelihoods in a few milliseconds. By removing these bottlenecks from 
parameter estimation codes, Pico decreases their computational time by 1 or 2 orders of magnitude. 
Pico has several important properties. First, it is extremely fast and accurate over a large volume of 
parameter space. Furthermore, its accuracy can continue to be improved by using a larger training set. 
This method is generalizable to an arbitrary number of cosmological parameters and to any range of 
^-values in multipole space. Pico is approximately 3000 times faster than CAMB for flat models, and 
approximately 2000 times faster then the WMAP 3 year likelihood code. In this paper, we demonstrate 
that using Pico to compute power spectra and likelihoods produces parameter posteriors that are very 
similar to those using CAMB and the official WMAP3 code, but in only a fraction of the time. Pico and 
an interface to CosmoMC are made publicly available at www.astro.uiuc.edu/~bwandelt/pico/. 

Subject headings: cosmic microwave background — cosmology: observations — methods: numerical 



1. INTRODUCTION 

With WMAP's second data relea se llHinshaw et all 
120061: iPaee et alJ l2006t iSpergel et aT]l2006|) . there is a 
wealth of new CMB data available to further constrain 
the cosmological parameters. The major computational 
burden in parameter estimation remains the calculation 
of the theoretical power spectrum for a large number 
of cosmological models as well as the likelihood based 
on these spectra. Generally the po wer spectrum is com- 
puted with codes such as CMB fast (|Seliak & Z aldarriaga 
11996ft or CAMB l|Lewis et al.ll2000D . which evolve the 
Boltzmann equation using a line of sight integration ap- 
proach. While this provides a 1 or 2 order of magnitude 
decrease in the computation time over the full Boltzmann 
codes, power spectrum calculations remain a bottleneck 
of par ameter estimation. Other softw are such as CMB- 
warp Ipimenez et al.ll200^) and DASh ( Kaolingha t et al.1 
l20f)i| have found ways to improve the efficiency of power 
spectrum calculations at the cost of a loss of accuracy 
against the full Boltzmann codes and/or placing restric- 
tions on the parameters which are available as input. 
In particula r, CMBwarp builds u pon the method in- 
troduced in l)Kosowskv et alJl2002j) . where a new set of 
nearly uncorrelated "physical" parameters were defined 
which have nearly independent effects on the power spec- 
trum. CMBwarp uses a modified polynomial fit whose 
coefficients are based on a fiducial model. It allows 
rapid calculation of the temperature (TT), E-mode po- 
larization (EE) and temperature-polarization (TE) cross 
power spectra. CMBwarp, however, requires the use of 
specific cosmological parameters and the accuracy of the 
computed power spectra quickly diminishes as one moves 
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away from the fiduci al model in paramet er space. An- 
other code, CMBFit l|Sandvik et alJl2004f) . attempts to 
avoid the need to compute the power spectrum by fitting 
the likelihood function. This idea is particularly impor- 
tant for the W MAP 3 year d ata llHinshaw et alj f2006: 
iPage et al.ll200l iSnereel et al]l2006fl whose likelihood is 
time consuming to compute. 

In this paper we introduce Pico, a computational tech- 
nique to accelerate both power spectrum and likelihood 
computations. This approach removes the 2 major bot- 
tlenecks in parameter estimation. While in a similar 
spirit as CMBwarp, and providing a similar speedup over 
CMBfast and CAMB, Pico has several important advan- 
tages over CMBwarp and DASh. First, it allows the 
calculation of power spectra from an arbitrary number 
of cosmological parameters and in any range of ^-values 
in multipole space. Because of this flexibility, it is easily 
incorporated into parameter estimation codes. Secondly, 
Pico allows the simultaneous computation of all scalar, 
tensor and lensed power spectra as well as the transfer 
functions. Pico provides more than an order of magni- 
tude increase in accuracy over CMBwarp, and about 2 
orders of magnitude increase in speed over DASh. Lastly, 
Pico is generic enough to allow the direct fitting of any 
likelihood functions. Due to the computational expense 
in computing the likelihood of certain experiments, e.g. 
WMAP3, any power spectrum acceleration scheme will 
at most provide a speed up of order 1 to 10 in parame- 
ter estimation. However, using Pico to also compute the 
likelihood results in speed ups of order 10 to 100. As an 
additional bonus, using Pico to compute the likelihood 
directly provides more accurate results then using it to 
fit the power spectra and computing the likelihood from 
these approximate spectra. This is important for current 
and next generation all-sky CMB data. Meanwhile, the 
power spectra computed by Pico are more than accu- 
rate enough for suborbital experiments with smaller sky 
coverage and coarser ^-resolution. 

This paper is organized as follows. We examine the 
CPU and memory requirements of Pico in sectional Sec- 
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tion presents several tests of the performance of Pico. 
This includes comparisons of power spectra computed 
using Pico and CAMB as well as results of parameter es- 
timation runs using Pico to compute the power spectra 
and the WMAP3 likelihood. In section 0] we summa- 
rize and discuss the future of Pico. The details of the 
algorithm used by Pico are presented in the appendix. 

2. CPU AND MEMORY REQUIREMENTS 

For a detailed description of our algorithm please re- 
fer to the appendix. The quantities that determine the 
CPU and memory requirements of the algorithm are the 
number of clusters n, the number of cosmological param- 
eters M x , the number of £- values and compressed ^-values 
M y and M' v and the order of the regression polynomial 
p. Each computation of the power spectra has (approxi- 
mately) no dependence on the number of clusters, since 
we only need to determine which cluster the input pa- 
rameters are in. This is found after n fast distance cal- 
culations. The power spectrum is then calculated using 
the polynomial in the cluster. This takes 2MM y compu- 
tations where M, the number of polynomial coefficients, 
is given by 

^-W~°((t)') fo '— °' 

After evaluating the polynomial, another M y M y calcu- 
lations are needed to uncompress the spectrum. It is 
thus possible to calculate the power spectrum with very 
few computations. For the 7 parameter case we examine 
in section 13.11 calculation of the power spectra takes ap- 
proximately 3 milliseconds on a 2 GHz Intel Pentium 
M processor. This is roughly 3000 times faster then 
CAMB for flat models and 15000 times faster for non- 
flat models. For parameter esti mation codes such as 
CosmoMC ijLewis fc Bridle! l2002|) . this speedup is sig- 
nificantly more than is necessary since evaluation of the 
likelihood quickly becomes the new bottleneck. How- 
ever, as we have noted, Pico removes this bottleneck 
as well by fitting the (computationally intensive) like- 
lihoods. Furthermore, other techniques such a s Gibbs 
sampling ijWandelt et al.ll200l IChu et alJl2005^l . which 
have quicker likelihood evaluations, continue to benefit 
from a significant speedup in computing the power spec- 
trum. 

The main memory use of the algorithm is in holding 
the nMM y regression coefficients. For the example we 
present in the next section using 4 th order polynomials 
in 7 parameters (M = 330) over 100 clusters and 60 com- 
pressed ^-values, this is approximately 15 MB of infor- 
mation. If fitting of the scalar and tensor modes out to 
£ = 3000 as well as the transfer function is included this 
number could increase by an order of magnitude. Even 
this can be accommodated by any modern personal com- 
puter. 

3. RESULTS 

In this section we provide several tests of the perfor- 
mance of Pico both in its ability to compute power spec- 
tra and in the results of parameter estimation runs. The 
first 2 tests compare the power spectra computed us- 
ing Pico with those using CAMB for 7 and 9 parameter 
models. Next we compare the results of a parameter esti- 
mation run using Pico and CAMB to compute the power 



spect rum and the 1 st year WMAP code l)Bennett et alJ 
2003) to compute the likelihood. Lastly, we compare pa- 
rameter estimation runs using Pico to compute the like- 
lihood with runs using CAMB and the official WMAP 3 
year likelihood code. In all cases Pico produces param- 
eter posteriors that are in good agreement with CAMB, 
both for the larger parameter space allowed by WMAP1 
and the higher precision required by WMAP 3. 

3.1. Power Spectrum Calculation for 7 Parameter 
Models 

Here we compare the performance of Pico with CAMB 
and CMBwarp. To generate our test set we be- 
gin with a converged Markov chain Monte Carlo run 
consisting of ~ 60000 cosmological models based on 



WMAP 1 st year iBennett et al.N2003t) and other CMB 
data, includ i ng CB T TPadin et alJl200l1ll. BOOME RANG 
(|Ruhl et alJ l200l. ACBAR llKuo et alJ l200l . VSA . 
llGrainge et al]l2003lk MAXI MA lHananv et alT fe OOO ) . 
DASI (|Halverson et alJ 12002]) and TOCO ijMiller et all 
1999). The varied parameters are the baryon density 
fib, the cold dark matter density fi c dm, the dark energy 
density Oa, Hubble's constant Ho, the scalar spectral in- 
dex n s , the optical depth since reionization r, and the 
normalization of the power spectra A s - Next we convert 
each point i n this space to the ph ysica l parameters in- 
troduc ed by Uimenez et al.l l)2004fl and iKosowskv et alJ 
( 2002) . In the physical parameter space there is signif- 
icantly less correlation in the set of points. We then 
calculate the mean and variance of this set, and use it 
to generate an 8-dimensional box in physical parameter 
space whose sides are of length 3er in each direction. Our 
test set consists of 10 4 models sampled uniformly from 
this box. That is, we in no way bias the test set by 
weighting points in parameter space based on their like- 
lihoods. The physical parameters are converted back to 
cosmological parameters and used to run CAMB to give 
the scalar TT, TE and EE power spectra. This set of 
10 4 models and their corresponding power spectra form 
our test set. 

The performance of Pico is shown in Figure . In this 
example, we ran Pico with 4 th order polynomials over 100 
clusters. We have plotted the error in units of the cosmic 
standard deviation as a function of the multipole I- value 
for the TT, TE and EE power spectra. The error in a 
single computed spectrum is defined as 
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where af v is the cosmic standard deviation. The three 
lines denote the average of this error over the test set and 
the error which bounds 95% and 99% of the test set (the 
95 th and 99 th percentiles) . The dashed black line in each 
plot denotes the expected uncertainty in data from the 
Planck satellite mission. Here we have assumed 65% of 
the sky will remain uncontaminated by foregrounds, and 
we have combined the 3 frequency bands from the LFI 
and the 3 lowest frequency bands from the HF I acco rding 
to the method described bv lZaldarriaga et all l)1997j) and 
IKinnevI <JT998). 

For 99% of the models in our test set, Pico is able 
to calculate the TT spectrum with an error less then 
0.3a cv , the TE spectrum with an error less then OAa 
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Fig. 1. — The above plots compare the performance of Pico with CAMB for 7 parameter models. The three lines denote the average 
error and the 95 th and 99 th percentiles over the 10 4 models in the test set. The dashed black line is the expected uncertainty from Planck 
data assuming f 3 k y = 0.65. The error is plotted in units of the cosmic standard deviation. 



and the EE spectrum to better then 0.7a cv for I out to 
1500. For the TE and EE spectra this excludes very low 
t where the magnitudes of the power spectra and cosmic 
variance become small. This is better than what will be 
achievable from even the Planck satellite mission. We 
note that the points with the largest error bars are near 
the edges of our training set and correspond to models 
that are highly disfavored even by CMB data alone. 

In Figure J2J), we have plotted the performance of 
CMBwarp against CAMB over the same test set. The 
four lines denote the average and 99 th percentile from 
CMBwarp as well as the average and 99 th percentile us- 
ing our code. We note that Pico is significantly more 
accurate over all I for the 3 power spectra. Pico gives 
more than an order of magnitude increase in accuracy 
over CMBwarp, while providing a similar decrease in the 
time required to compute a power spectrum as compared 
with CAMB. 

3.2. Power Spectrum Calculation for 9 Parameter 
Models 

As a second test of Pico, we calculate the scalar TT, TE 
and EE spectra as a function of 9 parameters. The train- 
ing set was formed using the 7 parameters as described 
in section 13. II in addition to the dark energy equation of 
state and the running of the scalar spectral index. The 
new parameters were drawn uniformly from the intervals 
[-1, -0.78] and [-0.085, 0] respectively. Figure © shows 
the performance compared with CAMB. In this example 
Pico was run with 4 th order polynomials over 100 clus- 
ters. While even at this level the accuracy is better then 
what will be achievable with Planck, one could continue 
to decrease the error by using a larger training set to 
allow the use of more clusters. 

3.3. Parameter Posteriors using Pico to compute Power 

Spectra 

We have incorporated Pico into the p ublicly available 
param eter estimation code CosmoMC l|Lewis fc Bridle] 
2002). The interface allows CosmoMC to use Pico to 
compute the theoretical power spectrum and transfer 
function as well as the WMAP3 likelihood whenever the 
parameters are within the range over which Pico's regres- 
sion coefficients are defined. For parameters outside this 
range, CosmoMC will continue to use CAMB to compute 
the power spectrum or the WMAP3 code to compute the 
likelihood. 

In this section we compare the posteriors over the pa- 
rameters computed by CosmoMC while using CAMB 



or Pico to compute the theoretical power spectrum. 
The likelihoods were computed usin g the WMAP I s * 
year data and likelihood function. ijVerde et all l2003t 
iHinshaw et~afl 120031: iKogut et all 12003ft . For this test 
we choose flat models and varied 6 parameters: flbh 2 , 
n c( j m /i 2 , 9, r, n s and the power spectrum amplitude _4 S . 
We ran CosmoMC using Pico for 500, 000 steps. CAMB 
was needed for less than 1% of the models. This took ap- 
proximately 15 hours. The posterior and mean likelihood 
over each parameter is shown in figures 10} and JHJ re- 
spectively. We have also plotted the posterior and mean 
likelihood from a 500,000 step run of CosmoMC using 
only CAMB, which took approximately 160 hours. The 
posteriors agree quite well, especially near the peaks. In 
every parameter except r, the mean of the posteriors dif- 
fer by less then 0.7%. For r, which is poorly constrained 
by this data set, the mean of the posteriors differ by 
3.7%. The errors in the likelihood evaluations from Pico 
are more apparent in Figure @ as the mean likelihood 
over the posterior depends on the square of the likeli- 
hood. Also, the likelihood is very sensitive to any corre- 
lated errors in the approximate power spectra computed 
by Pico. As will be shown in the following section, this 
problem is solved by using Pico to directly compute the 
likelihood. Even here, however, Pico agrees quite well 
with CAMB around the peak of the mean likelihood. 

In Figure © , we directly compare the accuracy of the 
likelihoods computed using power spectra from Pico and 
CMBwarp with CAMB. Using a uniformly sampled sub- 
set of the MCMC chain discussed in the previous para- 
graph, we first ordered the points by likelihood (black 
line). Next we recomputed the likelihood using power 
spectra from Pico (blue circles) and CMBwarp (red tri- 
angles) at each point. We see that the error in Pico is 
less then unity over two decades in likelihood. This is 
a significant improvement over CMBwarp. The dotted 
black lines are plus and minus one of the actual value of 
the log likelihood. 

3.4. Parameter Posteriors using Pico to compute the 
Likelihood 

As a final test, we ran CosmoMC using Pico to com- 
pute the WMAP3 likelihood. Figure compares the 
posteriors of this run with those using CAMB and the 
official WMAP 3 year likelihood code. The chains var- 
ied 7 parameters; these included the 6 parameters listed 
in section 13.31 as well as the dark energy equation of 
state w. The red, dashed line denotes the posterior using 
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FlG. 2. — The above plots compare the performance of Pico with CMBwarp. The four lines denote the average error and 99 th percentile 
for CMBwarp and the average error and 99 th percentile using Pico. The dashed black line is the expected uncertainty from Planck data. 
The error is plotted in units of the cosmic standard deviation. 
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FlG. 3. — The above plots compare the performance of Pico with CAMB for 9 parameter models. The three lines denote the average 
error and the 95 th and 99 th percentiles. The dashed black line is the expected uncertainty from Planck data. The error is plotted in units 
of the cosmic standard deviation. 



Pico to compute the power spectrum and the WMAP3 
likelihood. The black, solid line denotes the posterior 
using CAMB and the official WMAP3 likelihood code. 
By fitting both the likelihood and the power spectrum 
Pico provides a factor of 30 increase in speed. Further- 
more, with Pico, CosmoMC spends only about 1/5 of 
its time computing the power spectrum and likelihood, 
demonstrating that Pico has successfully removed these 
two bottlenecks from the parameter estimation process. 
Though it is not needed in this example, Pico also pro- 
vides the transfer function so that CAMB can compute 
the matter power spectrum and a%. 

4. CONCLUSION 

This paper provides a fast, accurate and robust method 
of calculating CMB power spectra and likelihood func- 
tions using local polynomial interpolation. A K-means 
clustering algorithm is used to partition the cosmologi- 
cal parameter space into local regions. Over each region 
we approximate the CMB power spectra as a polynomial 
in the cosmological parameters. This method, which we 
have named Pico, provides several orders of magnitude 
increase in speed over CAMB and the WMAP 3 year 
likelihood code, while proving accurate enough for the 
analysis of data from the current and next generation of 
cosmic microwave background experiments. The flexibil- 
ity of our algorithm enables it to handle any reasonable 
number of cosmological parameters. It has been gener- 
alized to allow the fast computation of any observables 
relevant to a particular data set, e.g. the transfer func- 
tions and the power spectrum of B-mode polarization 
anisotropies. Even higher order correlation functions, 
such as the reduced bispectrum, could be added. Pico 
is able to compute accurate power spectra over a large 



volume of parameter space consistent with the WMAP 
data. Furthermore, Pico's performance will only improve 
as the volume of space it must fit and the uncertainties 
in the parameters shrink. 

Pico is easily inserted into parameter estimation codes 
such as CosmoMC. It can be used to compute the power 
spectra, transfer function and the WMAP3 likelihood, 
resulting in a significant decrease in computational time. 
In fact, when Pico is used, CosmoMC spends 80% of 
its time on tasks other then computing the power spec- 
tra and likelihood. This time is spent generating ran- 
dom numbers, evaluating internal and derived parame- 
ters, etc. We envision that CAMB will only be needed 
for nonstandard cosmological models outside the scopes 
of our training sets. While it is likely possible to further 
improve the accuracy of our code by using less generic 
techniques, we have chosen to keep Pico as generic as 
possible to allow it to grow and adapt to the parameter 
estimation tasks of the next generation of experiments. 

We have made a Fortran 90 implementation of this 
algorithm publicly available. 5 Here the user will find 
regression coefficients to use the algorithm for various 
parameter sets, as well as short and straight forward in- 
structions for incorporating Pico into CosmoMC or using 
it as a front end for CAMB. The authors also welcome 
requests for regression coefficients for specific combina- 
tions and ranges of parameters. Enabling Pico on a new 
parameter set simply involves running CAMB to gener- 
ate a new training set. 

We would like to thank Lloyd Knox, Antony Lewis 
and Max Tegmark for useful discussion. This work 

5 http: / /www.astro. uiuc.edu/~bwandelt/pico/ 
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Fig. 4. — The 1-D posterior constraints on the cosmological parameters using runs of CosmoMC with Pico and CAMB. The likelihoods 
were computed using the 1 st year WMAP data and likelihood code based on the Pico power spectra (black, solid line) and the CAMB power 
spectra (red, dashed line). Using Pico decreased the time to compute the power spectra by a factor of 3000, and the overall computational 
time by a factor of 10, while providing very similar posteriors. 
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Fig. 5. — The mean likelihoods over the cosmological parameters using runs of CosmoMC with Pico and CAMB. The likelihoods were 
computed using the 1 st year WMAP data and likelihood code based on the Pico power spectra (black, solid line) and the CAMB power 
spectra (red, dashed line). Using Pico decreased the overall computational time by a factor of 10, while accurately fitting the peaks of the 
mean likelihoods. The error in the tails is due to correlated error in the Pico computed power spectra. 

was partially funded by NSF grant AST 05-07676, Teragrid (www.teragrid.org) Xeon cluster tungsten 
by NASA contract JPL1236748, by the National (login-w.ncsa.teragrid.org) and Itanium 2 cluster 
Computational Science Alliance under AST300029N mercury (tg-login. ncsa.teragrid.org) at NCSA, 
and by the University of Illinois. We utilized the 

APPENDIX 
ALGORITHM 

This appendix presents the basic algorithm Pico uses to calculate the angular power spectra. It consists of three major 
pieces, the compression of the training set power spectra, the clustering of the training set cosmological parameters, 
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Fig. 7. — The posteriors for 7 parameter flat models using the WMAP 3 year data. The solid (black) line denotes the posterior using 
Pico to compute both the power spectrum and the likelihood, while the dashed (red) line is the posterior using CAMB and the WMAP 3 
year likelihood. The Hubble constant Ho and the dark energy density are derived from the other 7 parameters. Using Pico to compute 
the likelihood and power spectra provides a factor of 30 increase in the speed of the parameter estimation code. The improvement over 
Figure|3]is the result of fitting the likelihood directly with Pico. 



and the calculation of the local regression polynomials. For clarity reasons, we will discuss the latter of these pieces 
first. 

Polynomial Interpolation 

Consider a training set of N vectors of cosmological parameters Xj , each of dimension J\f x and their corresponding 
power spectrum y^, each of dimension J\f y . The number of cosmological parameters and power spectrum values is 
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arbitrary. In general, y can be constructed by concatenating all the scalar, tensor and lensed power spectra as well as 
the transfer functions into a single vector living in R-^* . 

Our goal is to interpolate the function f that maps the cosmological parameters x into their power spectra y, i.e. 
y = f (x). This function is an J\f x dimensional manifold that is naturally embedded in an (J\f x + J\f y ) dimensional 
Euclidean space. Our method is to approximate this mapping using a polynomial in the cosmological parameters. The 
fc th component of y is then approximated as a p th order polynomial in the M x cosmological parameters: 

Ilk ^ ' Oii 1 i 2 ...i p Xi 1 Xi 2 ■ ■ 'Xi p . 

ii>i 2 >->i p 

The coefficients a^-.-i are chosen to minimize the squared error over the training set 

N 

i? 2 =^(y(x,)- yj ) 2 



This leads to a regression matrix which can be inverted to find the polynomial coefficients. 

We have generalized this algorithm to include arbitrary fitting functions, for example Chebyshev or Legendre poly- 
nomials. Our tests show that Pico performs at a similar level using these functions as using standard polynomials. 

Clustering 

The interpolation method described above fails to accurately model the power spectra over the entire parameter 
space. To remedy this, we would like to fit polynomials on disjoint local regions of the full parameter space, limiting 
the variation in the power spectra over the individual regions. While naively griding this large dimensional space 
would be computationally prohibitive, 

clustering avoids the "curse of dimensionality" by using the points in the training set to naturally divide the parameter 
space into smaller regions. A polynomial is used within each cluster to provide a local approximation of the power 
spectra within the cluster. It is then only necessary to ensure that each cluster has a sufficient number of training 
set points to accu rately calculate the regression coefficients. We implement clustering using the K-means algorithm 
l)MacQueenlll967|) . which we found adequate for our purposes. 

Ideally all clusters encompass volumes of parameter space over which the power spectra vary roughly equally. For 
example, we would like to take into account the fact that there is a roughly equal variation of the power spectra from 
a change in the baryon density of ~ 0.01 as from a change in the cold dark matter density of ~ 0.1. This is achieved 
by sphering the training set prior to clustering. A sphered data set is defined to have a covariance matrix equal to the 
identity. If C xx denotes the covariance matrix of the cosmological parameters that make up the training set, then the 
set is sphered by constructing the matrix MU such that 

MUC XX U T M T = ME XX M = I. 

Here U is the orthogonal matrix that diagonalizes C xx , M is a diagonal matrix whose entries are the inverse of the 
square root of the eigenvalues of C xx , and the diagonal matrix E xx contains the eigenvalues of C xx . The matrix MU 
is used to map the training set into a new sphered basis. In this basis the parameter space is clustered using the 
K-means algorithm and the standard Euclidean distance. Since in the sphered space, the power spectra corresponding 
to the parameters will vary equally in all directions, the clusters will retain this desired property when mapped back 
to the unsphered basis. 

In Figure i|A8(l we demonstrate the results of using the K-means clustering algorithm on a 2 dimensional parameter 
space, Af x = 2. The different point types distinguish the members of each of the 4 clusters. Note the difference in 
the arrangement of the clusters when the data is sphered prior to clustering. The sphered data ignores the scale and 
correlations of the parameters, giving clusters over which the power spectra vary roughly equally. 

Power spectrum compression 

The efficiency of the algor ithm can be improved by using Karhunen-Loeve compression UKaxhunenl 1194(3 iLoTvel 
Il955t iTegmark fc Bunnlll995|) to transform the power spectra subspace of the training set to a new, lower dimensional 
space. We begin with a training set of 10 4 power spectra generated as in 13.11 The training set consists of vectors 
yj formed from concatenating the scalar TT, TE, and EE power spectra evaluated at the 45 "usual" ^-values used 
by CMBfast out to I = 1500. The "usual" ^-values are the ones actually evolved by CMBfast or CAMB; the power 
spectrum is interpolated at the intermediary fs. After constructing the covariance matrix of the power spectra C yy , 
an eigen-decomposition gives a transformation matrix V having the property 

VCyyV T = Eyy, 

where E yy is a diagonal matrix containing the eigenvalues of C yy . In Figure (|A9|) . we have plotted the 30 largest 
eigenvalues of C yy . The fact that these eigenvalues vary over a large range indicates that some redundancy remains 
in the components of y. By choosing a new basis nearly all of the information in the 3 power spectra can be stored in 
significantly fewer coefficients. The compression matrix is formed by dropping the rows of V, which are the eigenvectors 
of Cyy, that have small eigenvalues (relative to the largest). Then V is a mapping from a 135 dimensional space to a 



Fig. A8. — An example of sphering and K-means clustering using a 2 parameter training set. The top left figure is the original data 
set. In the top right figure the training set has been clustered into 4 regions. Since the variation of the parameters is significantly larger 
in the horizontal direction, the clustering simply divides along this axis. However, the power spectrum will vary equally in both directions 
of parameter space, so there is a much larger variation in the power spectrum along the vertical direction of each cluster. This property 
can be avoided by sphering the parameter space prior to clustering. The bottom left figure shows the data set after sphering and then 
clustering. In this basis the parameters and the power spectrum vary equally in all directions. The bottom right figure shows the data set 
back in the original basis. Now there is no bias in the clusters based on the scale or correlations of the parameters. The clusters retain the 
property that the power spectrum will vary equally across each cluster. 




Fig. A9. — Plot of the 30 largest eigenvalues of the covariance matrix of the training set power spectra. There are 135 total eigenvalues. 
The fact that only a few of the eigenvalues dominate indicates that the power spectra can be compressed significantly by rotating into a 
new basis and projecting out directions corresponding to small eigenvalues. In the examples below we keep the directions corresponding to 
the 60 largest eigenvalues. 



much smaller (~ 60 dimensional) space. Since a set of polynomial regression coefficients is needed for each component 
of y, this compression algorithm provides a significant reduction in the computation time and memory requirements 
of the algori thm. 

In Figure IjAf 01) we plot the error accrued due to the compression of the power spectra. That is, we computed V, 
truncated the specified number of rows and calculated 

y/ = V T Vy 

over the 10 4 models, computed using CAMB, that we will use as our test set in section ©. The dimensionless average 
error in the plots is defined as the mean absolute deviation: 

where Ce and Cf AMB denote the individual power spectra that make up y/ and y respectively. The brackets denote 
averaging over the f0 4 models and af Y is the cosmic standard deviation computed using C^ AMB . Recall that the 
cosmic standard deviation of the TT, TE and EE spectra are given by 

CV.TT / 2 TT 
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Fig. A10. — Plots of the error accrued due to compressing the power spectra as a function of the number of compressed Ps. The 3 lines 
denote the average error, the 99-percentile error bar, and the worst error over the 10 4 models in the test set. The worst error is the largest 
deviation from CAMB at any I in any member of the test set. For nearly all of the models, only ~ 60 compressed Ps are needed of the 135 
possible to maintain sufficient accuracy in the compressed power spectra. 
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